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The  many  applications  of  the  distance  matrix,  D(G),  and  the  Wiener 
branching  index,  W(G),  in  chemistry  are  briefly  outlined.  W(G)  is 
defined  as  one  half  the  sum  of  all  the  entries  in  D(G).  A  recursion 
formula  is  developed  enabling  W(G)  to  be  evaluated  for  any  molecule 
whose  graph  G  exists  in  the  form  of  a  tree.  This  formula,  which 
represents  the  first  general  recursion  formula  for  trees  of  any  kind, 
is  valid  irrespective  of  the  valence  of  the  vertices  of  G  or  of  the 
degree  of  branching  in  G.  Several  closed  expressions  giving  W(G)  for 
special  classes  of  tree  molecules  are  derived  from  the  general  formula. 
One  illustrative  worked  example  is  also  presented.  Finally,  it  is 
shown  how  the  presence  of  an  arbitrary  number  of  heteroatoms  in  tree-like 
molecules  can  readily  be  accommodated  within  our  general  formula  by 
appropriately  weighting  the  vertices  and  edges  of  G.. 
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I.  General  Introduction 

\' 

“"Contemporary  chemistry  is  focusing  to  an  ever-increasing  extent 
on  the  relationships  between  the  structure  of  molecules  and  their 
physicochemical  properties.  As  part  of  this  scenario,  much  attention 
has  been  devoted  in  recent  years  to  the  search  for  suitable 
graph-theoretical  means  of  describing  the  topological  structure  of 
a  vast  array  of  known  chemical  systems.  In  particular,  there  has  been 
widespread  usage  of  topological  graphs  and  matrices  for  the 
characterization  of  both  individual  molecular  species  and  a  variety 
of  intermolecular  interactions . "These  developments  have  brought 
in  their  wake  a  resurgence  of  activity  in  applying  combinatorics  and 
graph  theory  to  the  solution  of  chemical  problems. However,  although 
the  utilization  of  graphs  in  chemistry  has  a  time-honored  and 
comprehensive  history,7  the  adoption  of  matrices  by  chemists  is  of 
much  more  recent  vintage.  Matrices  which  have  been  employed  to  date 
for  the  characterization  of  chemical  systems  include  the  adjacency 


1.  D.H.  Rouvray,  Roy.  Inst.  Chem.  Revs  _4,  173  (1971). 

2.  I.  Gutman  and  N.  Trinajsticf,  Fortschr.  Chem.  Forsch.  42,  50  (1973). 

3.  D.H.  Rouvray,  in  Chemical  Applications  of  Graph  Theory,  ed.  A.T. 

Balaban,  Academic  Press,  London,  1977,  chap.  7. 

4.  D.H.  Rouvray  and  A.T.  Balaban,  in  Applications  of  Graph  Theory, 
ed.  R.J.  Wilson  and  L.  Beineke,  Academic  Press,  London,  1979, 
chap.  7. 

5.  N.  Trinajstic,  Chemical  Graph  Theory  I,  CRC  Press,  Boca  Raton, 

Florida,  1983,  chap.  4. 

6.  N.  Trinajsticf,  Chemical  Graph  Theory  II,  CRC  Press,  Boca  Raton, 

Florida,  1983. 


7. 


D.H.  Rouvray,  In  preparation. 
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matrix ,  the  incidence  matrix,  the  cycle  matrix,  and  the  distance 
matrix. 2”5  "0Ur  prime  focus  of  interest  here  will  center  on  the  distance 
matrix,  and  more  especially  on  its  derivation  for  the  important  class 
of  graphs  commonly  referred  to  as  chemical  trees. ®  These  graphs  have 
been  used  extensively  in  the  physical  and  biological  sciences  for  the 
depiction  and  characterization  of  many  different  structures  and  systems. 
Following  chemical  intuition,  the  usual  restriction  placed  upon  the 
subclass  of  chemical  trees  known  as  carbon  trees  (C-trees)  has  been 
that  no  vertex  in  the  tree  could  have  a  valence  exceeding  four.  In 
our  derivation  of  the  distance  matrix  for  trees,  however,  no  restriction 
of  any  kind  has  been  imposed  upon  the  types  of  trees  that  can  be 
considered.  Accordingly,  we  are  now  able  to  present  the  first,  closed 
recursion  formula  for  the  general  tree.  Our  formula  remains  valid 
irrespective  of  the  valences  of  the  vertices,  the  degree  of  branching 
present,  or  the  presence  of  weighted  vertices  or  edges  in  the  tree. 

The  distance  matrix,  D(G),  can  be  defined  for  any  graph  G 
constructed  on  a  total  of  n  vertices. ^  D(G)  is  the  real,  symmetric 
matrix  of  order  n  x  n  whose  elements,  d^j ,  are  defined  as  follows: 


D(G) 


-  0 

*  length  of  shortest  path  between 
vertices  _i  and  in  G 


As  we  are  concerned  here  only  with  tree  graphs,  the  number  of  vertices, 
n,  will  always  exceed  the  number  of  edges  in  G  by  one.  Moreover,  the 


8.  For  the  first  use  of  this  term  see:  A.  Cayley,  Phil.  Mag.  47, 
444  (1874). 

9.  D.M.  Cvetkovic,  M.  Doob,  and  H.  Sachs,  Spectra  of  Graphs,  Academic 
Press,  New  York,  1980. 
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path  connecting  any  pair  of  vertices  in  G  will  always  be  unique.  In 
the  case  of  chemical  trees,  the  diagonal  entries  are  not  necessarily 
always  zero.  The  d^£  assume  the  value  zero  only  if  the  species 
represented  are  C-trees.  Similarly,  in  certain  instances,  the  dj j 
(i  f  j)  can  also  take  nonintegral  values.  This  circumstance  arises 
whenever  heteroatoms  are  present  in  the  molecule,  for  then  appropriately 
edge-  and  vertex-weighted  trees  are  employed  to  represent  the  species. 
The  corresponding  d^^  and  d^j  values  can  be  obtained  from  general 

formulas  put  forward  by  Barysz  et  al.^0  In  such  cases,  the 

characterization  of  the  tree  structure  by  the  modified  D(G)  matrices 
is,  of  course,  no  longer  a  strictly  topological  one. 

Up  to  the  present,  numerous  applications  of  the  distance  matrix 
in  a  wide  variety  of  different  disciplines  have  been  documented.  Apart 
from  the  frequent  use  of  D(G)  as  a  foundation  for  a  number  of  topological 
indices  (discussed  in  some  detail  below),  the  matrix  has  also  been 

employed  in  several  other  chemical  contexts.  Clark  and  Kettle^  made 
use  of  D(G)  in  their  study  of  isomer  interconversions  for  ensembles 

of  stereochemically  nonrigid  molecules  such  as  PF5,  with  the  d^ j 
representing  the  shortest  path  sequence  by  which  one  isomer  could  be 


10.  M.  Barysz,  G.  Jashari,  R.S.  Lall,  V.K.  Srivastava,  and  N. 
Trinajsti^,  in  Chemical  Applications  of  Topology  and  Graph  Theory, 
ed.  R.B.  King,  Elsevier,  Amsterdam,  1983,  p.  222. 

11.  M.J.  Clark  and  S.F.A.  Kettle,  Inorganica  Chimica  Acta  201 

(1975). 


converted  into  another.  D(G)  has  also  found  applications  in  the 

modelling  of  crystal  growth^  and  in  the  study  of  crystal  vacancies, 
where  the  djj  now  represent  lattice  bonds.  By  means  of  this  approach 
it  was  possible  not  only  to  characterize  imperfections  in  crystals 
and  clusters  but  also  to  predict  vacancy  favored  positions.  Another 

application  of  D(G)  has  involved  the  prediction  of  the  mean  ^-electron 
energies  and  energy  gaps  in  conjugated  infinite  polymers  using  both 
the  HUckel  and  Pariser-Parr-Pople  approximations.!^  More  recently, 
D(G)  has  been  used  in  the  calculation  of  a  number  of  physicochemical 
parameters  for  these  polymers. 1^  Biochemists  have  utilized  D(G)  in 
the  construction  of  evolutionary  and  phylogenetic  trees, 16,17  W£th 
the  dj j  representing  the  mutation  distances  between  genes  as  estimated 
from  cytochrome  c  sequences.  The  matrix  has  also  been  employed  in 

several  other  disciplines,  including  psychology, l8  communications 
theory, 19  and  network  flow  design. 20  Finally,  and  perhaps  not 

surprisingly,  D(G)  has  also  been  the  subject  of  intensive  investigation 


12.  D.  Bonchev,  0.  Mekenyan,  and  H.  Fritsche,  Cryst.  Growth  49^,  90 

(1980). 

13.  (a)  D.  Bonchev,  0.  Mekenyan,  and  H.  Fritsche,  Phys.  Stat.  Sol. 

A55 ,  181  (1979);  (b)  D.  Bonchev,  0.  Mekenyan,  and  H.  Fritsche, 

Phys.  Stat.  Sol.  A56,  607  (1979). 

14.  (a)  D.  Bonchev  and  0.  Mekenyan,  Z.  Naturforsch.  35A,  739  (1980); 

(b)  D.  Bonchev,  0.  Mekenyan,  and  O.E.  Polansky,  Z.  Naturforsch. 

36A,  643  (1981). 

15.  (a)  D.  Bonchev,  0.  Mekenyan,  and  O.E.  Polansky,  Z.  Naturforsch. 

36A,  647  (1981);  (b)  0.  Mekenyan,  S.  Dimitrov,  and  B.  Bonchev, 

Eur.  Polym.  J.  19>  1185  <1983). 

16.  W.M.  Fitch  and  E.  Margoliash,  Science  155,  279  (1967). 

17.  W.H.  Li,  Proc.  Natl.  Acad.  Sci.  USA  _78,  1085  (1981). 

18.  H.J.  Leavitt,  J.  Abn.  and  Soc.  Psychol.  46^,  38  (1951). 

19.  R.L.  Graham  and  H.0.  Poliak,  Bell  Syst.  Tech.  J.  50_t  2495  (1971). 

20.  M.L.  Fredman,  SIAM  J.  Comput.  5_,  83  (1976). 


by  many  mathematicians . 21"25 

The  earliest  use  of  the  distance  matrix  in  chemistry  was  in  an 
implicit  form.  In  1947  Wiener^?  proposed  the  idea  of  using  a  so-called 
path  number,  w,  for  the  purpose  of  correlating  the  molecular  structure 
of  alkane  species  with  their  boiling  point,  molecular  refractivity, 
molecular  volume,  and  heats  of  isomerization  and  vaporization.  He 
defined  w  as  the  sum  of  the  chemical  bonds  existing  between  all  pairs 
of  carbon  atoms  in  such  molecules.  Wiener  also  put  forward^?  a  polarity 
number,  p,  which  was  defined  as  the  number  of  pairs  of  carbon  atoms 
separated  by  three  carbon-carbon  bonds  in  an  alkane  species.  The  boiling 
points  of  individual  alkanes,  tg,  were  assumed  to  be  given  by  a 
biparametric  linear  relationship  of  the  form: 

tg  ■  a*w  +  b*p  +  c  (1) 

where  a,  b,  and  c  are  constants  for  a  given  group  of  isomers.  The 


21.  S.L.  Hakimi  and  S.S.  Yau,  Quart.  Appl.  Math.  22.»  305  (1964). 

22.  M.  Edelberg,  M.R.  Garey,  and  R.L.  Graham,  Discrete  Math.  14, 

23  (1976). 

23.  R.L.  Graham,  A.J.  Hoffman,  and  H.  Hosoya,  J.  Graph  Theory  _1_>  85 
(1977). 

24.  R.L.  Graham  and  L.  Lovasz,  Adv.  in  Math.,  ,29,  60  (1978). 

25.  A.  Neumaier,  Eur.  J.  Comb,  ^l,  163  (1980). 

26.  J.M.S.  SimS’es-Pereira  and  C.M.  Zamfirescu,  Lin.  Alg.  and  Applic. 

44,  1  (1982). 

27.  (a)  H.  Wiener,  J.  Am.  Chem.  Soc.,  £9,  17  (1947);  (b)  H.  Wiener, 

J.  Am.  Chem.  Soc.  ££,  2636  (1947);  (c)  H.  Wiener,  J.  Chem.  Phys. 

_15,  766  (1947);  (d)  H.  Wiener,  J.  Phys.  Chem.  52,  425  (1948); 

(e)  H.  Wiener,  J.  Phys.  Chem.  52,  1082  (1948). 


two  structural  variables,  w  and  p,  have  since  been  shown  to  be  directly 
derivable  from  the  distance  matrix,  D(G).  The  path  number,  w,  la  equal 
to  half  the  sum  of  the  elements  of  D(G),  and  the  polarity  number,  p, 
equals  the  number  of  off-diagonal  elements  in  D(G)  representing  the 

distance  three,  i.e. 

w(G)  -  4  E  dij(G)  (2) 

£L 

p(G)  *  £  W3(G)  (3) 

where  W3(G)  is  twice  the  number  of  paths  in  G  of  length  three. 

Relationship  (1)  instituted  the  first  use  in  chemistry  of 

topological  distances,  i.e.  path  counts,  for  the  correlation  of 
physicochemical  properties  with  the  degree  of  molecular  branching  in 
individual  species.  Relationship  (2),  originally  due  to  Hosoya,28 

demonstrates  the  close  connection  between  w(G)  and  the  distance  matrix, 
D(G).  Nowadays,  the  Wiener  path  number  is  usually  regarded  as  the 
first  topological  index. ^  Since  many  authors  currently  refer  to  this 
index  by  the  symbol  W(G),  we  shall  also  adopt  this  convention  hereafter. 
Topological  indices  (TIs)  provide  in  general  a  convenient  mean3  of 
assigning  a  scalar  magnitude  to  any  structure  of  chemical  significance. 
Starting  from  the  chemical  graph  of  an  individual  species,  a  TI  can 
the  derived  by  applying  an  appropriate  mathematical  algorithm  or  formula 
to  the  graph.  The  resulting  scalar  descriptor  is  subsequently  employed 
for  the  mathematical  characterization  of  the  species  in  question.  Over 


the  past  decade,  a  large  cumber  of  TIs  have  been  advocated,  with  special 
emphasis  being  placed  on  those  which  can  offer  a  reliable  means  of 
reflecting  the  amount  of  branching  present  in  both  acyclic  and  cyclic 
molecular  species.  Attesting  to  the  current  high  level  of  interest 
in  this  field,  several  reviews  of  TIs  and  their  manifold  applications 
to  chemistry  have  recently  appeared. 6i29-32 

The  Wiener  index  has  been  widely  used  in  a  variety  of  guises  for 
the  correlation  of  branching  with  the  bulk  physicochemical  properties 
of  species.  As  mentioned  above,  Wiener  himself  correlated  W(G)  with 
a  number  of  different  physical  properties  of  alkane  species.  This 
work  was  elaborated  by  Platt, 33  who  developed  an  additional  graphical 
invariant  called  the  first  neighbor  sum,  f,  equal  to  the  sum  of  the 
first  C-C  neighbors  of  every  C-C  bond  in  the  species.  The  index  f 
was  used  together  with  p  and  W(G)  in  multipararnetric  equations  to  improve 
on  the  correlations  obtained  by  Wiener.  Platt  obtained  very  good 
correlations  and  rationalized  this  success  by  hypothesizing  that  W(G) 
provided  a  measure  of  the  mean  external  contact  area  of  the  molecule 
characterized.^  Altenburg^  put  forward  a  polynomial  of  the  form: 

29.  D.  Bonchev,  J.V.  Knop,  and  N.  Trinajstic,  Math.  Chem.  21  (1979). 

30.  I.  Mo toe,  A.T.  Balaban,  0.  Mekenyan,  and  D.  Bonchev,  Math.  Chem. 

JJ,  369  (1982). 

31.  D.H.  Rouvray,  in  Chemical  Applications  of  Topology  and  Graph  Theory, 
ed.  R.B.  King,  Elsevier,  Amsterdam,  1983,  p.  159. 

32.  A.T.  Balaban,  I.  Motoc,  D.  Bonchev,  and  0.  Mekenyan,  Topics  Curr. 

Chem.  114,  21  (1983). 

33.  (a)  J.R.  Platt,  J.  Chem.  Phys.  _15,  419  (1947);  (b)  J.R.  Platt, 

J.  Phys.  Chem.  £6,  328  (1952). 

34.  (a)  K.  Altenburg,  Kolloid  Zeitschr.  V78,  112  (1961);  (b)  K. 

Altenburg,  Brennstoff  Chem.  4>7,  100  (1966);  (c)  K.  Altenburg, 

Brennstoff  Chem.  47,  331  (1966). 
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n 

P(G,x)  *  V  akxk,  (4) 

k=l - 

where  ak  is  the  number  of  pairs  of  atoms  in  alkane  species  separated 
by  the  distance  k,  and  xk  is  the  indexed  variable.  Since  ak  is  simply 
half  the  frequency  number  of  the  distance  dk  in  D(G),  it  follows  that: 

n 

W(G)  »  y  ak*k  (5) 

k=l  ~ 

and  thus  another  route  to  calculate  W(G)  becomes  accessible. 

The  first  extension  of  W(G)  beyond  its  initial  application  to 

acyclic  alkane  species  was  made  by  Hosoya,28  who  proposed  a  means  of 
calculating  W(G)  for  cyclic  alkanes.  Hosoya's  idea  was  that  the  d£j 
entry  in  D(G)  be  taken  as  the  length  of  the  shortest  path  connecting 
the  _ith  and  jth  atom  in  cyclic  species.  This  idea  was  adopted  by 
Rouvray^  £n  calculating  an  index,  R(G),  which  he  defined  as  the  sum 
of  the  d^j  elements  in  D(G).  Since  from  equation  (2)  we  know  that 
W(G)  is  half  the  sum  of  these  elements,  it  immediately  follows  that 

R(G)  =  2W(G).  The  index  R(G)  was  determined  for  several  different 
types  of  species,  namely  for  members  of  the  alkane,  alkene,  alkyne 

and  arene  series,  and  correlations  were  established  for  the  boiling 
points,  melting  points,  densities,  refractive  indices,  surface  tensions, 
and  viscosities  in  each  case.  All  the  correlations  were  based  on  a 
Walker-type-^  relationship  of  the  form: 

35.  (a)  D.H.  Rouvray,  Math.  Chem.  _1,  125  (1975);  (b)  D.H.  Rouvray 
and  B.C.  Crafford,  South  Afr.  J.  Sci.  T2_*  47  (1976). 

36.  J.  Walker,  J.  Chem.  Soc.  65,  725  (1894). 
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z  -  alR(G)]B,  (6) 

where  z  represents  some  physicochemical  parameter,  and  a  and  8  are 
two  constants  to  be  determined  by  least  squares  recursion.  Good  to 
excellent  correlations  were  found  in  all  cases.  More  recently,  it 
has  been  demonstrated  by  Bonchev  et  al.^7  that  the  W(G)  values  for 
monoalkyl-  and  o-dialkylbenzenes  correlate  extremely  well  with  their 
chromatographic  retention  times.  Here  too,  a  biparametric  Walker-type 
relationship  was  employed,  and  the  results  were  interpreted  by  assuming 
that  retention  indices  depend  substantially  upon  molecular  size  and 
shape.  To  conclude  our  discussion  of  this  topic,  we  recall  that  W(G) 
has  also  been  employed  to  predict^  the  energy  relationships  existing 
in  series  of  infinitely  conjugated  polymer  systems  (vide  supra). 

The  index  W(G)  *  h  1  dj j(G)  is  important  not  only  because  of  its 
wide  variety  of  applications  and  the  many  excellent  correlations  it 
yields,  but  also  because  it  represents  the  natural  starting  point  for 
a  number  of  more  recent  TIs.  Thus  TIs  which  are  closely  related  to 
or  directly  derivable  from  D(G)  include  the  previously  mentioned  polarity 
number  of  Wiener^  and  the  first  neighbor  sum  of  Platt^  as  well  as 
the  Gordon  and  Scantlebury  index, 3®  the  distance  sum  connectivity  index 
of  Balaban,^  the  information- theoretical  indices  of  Bonchev  and 

37.  D.  Bonchev,  0.  Mekenyan,  G.  Profit,  and  N.  Trinajsti£,  J.  Chromatog. 

176,  149  (1979). 

38.  M.  Gordon  and  G.R.  Scantlebury,  Trans.  Far.  Soc.  ^0,  604  (1964). 


39.  (a)  D.  Bonchev,  A.T.  Balaban,  and  0.  Mekenyan,  J.  Chem.  Inf.  Comp. 

Sci.  2£,  106  (1980);  (b)  A.T.  Balaban,  Chem.  Phys.  Lett.  89, 

399  (1982). 
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Trinajsti^, the  atomic  structural  index  of  Randic^  used  by  Seybold 
to  investigate  the  carcinogenicity  of  aromatic  hydrocarbons^  and  their 
derivatives,^  and  the  molecular  identification  number  of  Randi(^.^ 
In  view  of  their  widespread  use  in  chemistry  and  other  disciplines, 
W(G)  values  have  been  determined  for  several  specific  classes  of 
molecules.  This  has  been  accomplished  by  means  of  appropriate  recursion 
formulas.  The  first  of  these,  due  to  Hosoya,28  showed  that  for  straight 
chain  alkane  species  W(G)  assumes  the  simple  closed  form  (n^  -  n)/6, 
where  n  is  either  the  number  of  carbon  atoms  in  the  species  or  the 
number  of  vertices  in  the  graph  G.  (We  assume  throughout  in  our 
discussion  that  all  chemical  species  are  represented  by  C-trees,  or, 
equivalently,  by  their  hydrogen- suppressed  graph.)  Following  on  from 
this  early  work,  W(G)  has  been  systematically  evaluated  for  many  systems 
of  chemical  interest  in  a  series  of  papers  by  Bonchev  and  his  associates. 
Up  to  the  present,  these  workers  have  published  recursion  formulas 
for  trees  having  unbranched  branches,^®  monocylic  structures cyclic 
structures  having  acyclic  branches,^  polycyclic  structures, ^  spiro 

40.  D.  Bonchev  and  N.  Trinajsti^,  J.  Chem.  Phys.  67,  4517  (1977). 

41.  M.  Randi£,  Math.  Chem.  5  (1979). 

42.  P.G.  Seybold,  Intn.  J.  Quant.  Chem.,  Quant.  Biol.  Symp.  JJ),  95 
(1983). 

43.  P.G.  Seybold,  Intn.  J.  Quant.  Chem.,  Quant.  Biol.  Symp.  JU),  103 
(1983). 

44.  M.  Randic,  J.  Chem.  Inf.  Comput.  Sci.  24,  164  (1984). 

45.  D.  Bonchev,  0.  Mekenyan,  J.V.  Knop,  and  N.  Trinajstic,  Croat.  Chem. 
Acta  52,  361  (1979). 

46.  0.  Mekenyan  and  D.  Bonchev,  and  N.  Trinajsti^,  Croat.  Chem.  Acta 
56,  237  (1983). 

47.  D.  Bonchev,  0.  Mekenyan,  and  N.  Trinajstic,  Intn.  J.  Quant.  Chem. 
17,  845  (1980). 


systems, ^8  and  bridged  polycyclic  structures. 49  Ho  global  recursion 
formula  has  ever  been  advanced  for  the  general  tree;  this  lacuna  we 
now  remedy  here. 

II.  Mathematical  Formalism 

The  method  we  adopt  to  determine  W(G)  for  the  general  tree  makes 
inherent  use  of  the  fact  that  for  all  trees  there  will  exist  one  and 
only  one  path  connecting  any  pair  of  vertices  in  the  tree.  It 
automatically  follows  that  this  unique  path  must  be  the  shortest  path 
between  the  vertices.  In  our  derivation  we  shall  also  exploit  the 
recursive  nature  of  trees  by  studying  the  decomposition  of  the  general 
tree  into  a  number  of  subtrees  around  a  chosen  vertex.  We  start  from 
a  completely  general  tree  structure  having  a  total  of  n  vertices;  this 
structure  we  shall  designate  by  the  symbol  T.  For  convenience  in 
visualizing  our  approach,  we  depict  T  as  shown  in  Figure  1.  One  vertex 
in  T  has  been  arbitrarily  selected  as  our  focus  of  attention;  this 
vertex  we  refer  to  as  the  root  of  the  tree  and  assign  it  the  symbol 
v;.  Emanating  from  vr  will  be  a  total  of  m  edges  which  terminate  on 
vertices  v]_,  V£,  ....,  vm.  Each  of  the  vertices  vj  to  vm  is  regarded 
as  the  apical  vertex  of  a  subtree,  as  illustrated  in  Figure  1.  These 
subtrees  will  be  designated  by  the  symbols  T^,  T£»  . ..  Tm  and  we  shall 
assume  chat  each  subtree  contains  a  total  of  nj_,  &2>  •••»  "m  vertices, 

48.  0.  Mekanyan,  D.  Bonchev,  and  N.  Trinajstic,  Math.  Chem.  93 

(1979). 

49.  0.  Mekenyan,  D.  Bonchev,  and  N.  Trinajstic,  Intn.  J.  Quant.  Chem. 

19,  929  (1981). 


respectively.  Since  each  subtree  is  of  arbitrary  size  and  complexity, 
this  type  of  decomposition  of  T  can  be  carried  out  without  loss  of 
any  generality. 

To  determine  W (G)  for  T,  account  must  be  taken  of  the  path  distances 
separating  all  pairs  of  vertices  in  T.  For  convenience  of  visualization, 
we  shall  use  the  symbol  to  denote  an  arbitrary  vertex  selected  from 
each  subtree  T^,  where  1  £  i  £  m.  Inspection  of  Figure  1  reveals  that 
three  differing  types  of  distance  need  to  be  considered  for  the 
calculation  of  W(G).  These  are:  (I)  distances  originating  from  vertex 
vr  and  terminating  on  some  other  vertex  in  T;  (II)  distances  originating 
from  a  vertex  in  subtree  T^  and  terminating  in  the  same  subtree 
T^;  and  (III)  distances  originating  from  a  vertex  q^  in  subtree  T^ 
and  terminating  on  a  vertex  qj  in  another  subtree  Tj.  By  considering 
each  of  these  distance  types  in  turn  and  deriving  an  appropriate  formula 
for  each,  we  show  how  a  recursive  algorithm  can  be  developed  which 
yields  W(G)  for  the  general  tree  T.  As  is  usual  in  developing  such 
algorithms,  we  shall  assume  that  certain  information  is  known  about 
each  of  the  subtrees  T]_  to  Tm.  In  particular,  we  assume  that  for  each 
subtree,  Tj:,  the  sums 


Si  -  £  d(v^,q^)  (7) 

“  qAe  TA  -  " 

are  known,  where  d(v£,qi)  is  the  distance  traversed  in  moving  from 

vertex  Vi  to  vertex  qi  with  1  <  _i  <  m.  We  show  how  this  information 

can  be  exploited  to  attain  the  desired  result  for  T,  as  well  as  the 
corresponding  quantity  S  for  T. 


III.  Derivation  of  W(G)  for  the  General  Tree 


The  recursion  formula  we  derive  for  the  general  tree,  T,  is 
comprised  of  three  parts,  one  for  each  of  the  three  different  distance 
types  referred  to  above.  We  now  consider  each  distance  type 
individually. 

(a)  Type  (I)  Distances 

For  the  subtree  T^,  the  sum  of  the  path  lengths  from  vertex  vr  to  vertex 
may  be  expressed  as: 


s  =  V  dCvy.q^.  (8) 

ueTi  — 

If  all  m  subtrees  attached  at  vertices  v*  (1  <  J.  <  m)  are  taken  into 
consideration,  the  total  number  of  distances  becomes: 


S(I^  *  I  (  l  d(vr,qi)).  (9) 

1*1  <UeTi 


Since  q^  e  T^,  we  know  that  in  all  cases 


d^jr.^i)  *  d(vj^.q^)  +  1.  (10) 


Accordingly,  equation  (9)  will  assume  the  form: 


S(I>  -  l  (  l  (d(vijqi)  +  1)) 
1*1  qieTi 


(11) 
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l  l  d(vi,qi)  +  l  l  i. 

j>l  qieTi  i.«l  qieli 


By  making  use  of  equation  (7)  and  the  definition  of  n^,  we  can  rewrite 
equation  (12)  as: 


S(I)  -  £  Si  +  l  n± 

i-1  ~  i-1  ~ 


(b)  Type  (II)  Distances 


The  path  lengths  here  relate  to  vertices  entirely  located  within  a 
given  subtree,  T^  As  pointed  out  in  the  previous  section,  the 
assumption  is  made  that  the  Wiener  index,  Wi,  is  known  for  each  of 
the  Ti*  Accordingly,  the  expression  for  the  sum,  will  take 

the  simple  form: 


S<H>  -  £  Wi 

i-1  ” 


(c)  Type  (III)  Distances 


To  determine  the  path  length  sum  for  paths  starting  in  one  subtree, 
Ti,  and  terminating  in  another  subtree,  Tj ,  we  shall  focus  for  simplicity 
on  the  vertices  qi  e  Ti  and  q^  e  T^.  The  distance  traversed  in 
proceeding  from  qi  to  q^  will  be: 


<*(qi.iq^)  ■  d(<u»vi)  +  d(q^,v^)  +  2 


Summing  for  all  possible  pairs  (qi,  q^)  by  making  use  of  the  relationship 


expressed  in  equation  (15),  ve  obtain  for  the  general  summation: 


gdll)  -  £  d(<ll,vi)  (n2  +  n3  +  ....  +  iijjj) 

<lleTl 


+  £  d(q2»V2)  (ni  +  n3  +  ....  +  1%)  +  .... 

<12  6  t2  ~ 


+  l  d(qn>vm)  (ni  +  n2  + - n^-i) 

<Im  e 

+  2  (number  of  pairs)  (16) 


A  closed  expression  for  the  number  of  pairs,  (<Ii»qj)»  may  be  derived 
in  the  following  way: 


Number  of  pairs  *  njn2  +  njn3  + . +  ^lnm 

+  n2«3  +  0304  + . +  n2nm 

+  ............  +  nm_ nm 

*  [(n^  +  n2  +  ....  +  nm)2  -  n2  -  n2  -  ....  -  t%2]/2 
m 

-  [(n  -  l)2  -  r  n2]/2  (17) 

i-1  - 

Equation  (16)  may  now  be  simplified  by  making  use  of  equation  (7)  thus: 


g(IIl)  m  (n  -  1  -  n^)  +  S2  (n  -  1  ■  n2)  + . +  Sm  (n  -  1  _  1%) 


m 


+  (n  -  l)2  -  £  n2 

i-1  - 


m  m 

-  V  St  (n  -  1  -  n^)  +  (n  -  l)2  -  £  n^2 
i-1  “  “  i-1  “ 


(18) 


(d)  The  Final  Result 


At  this  point  we  are  in  a  position  to  sum  the  topological  distances 
represented  by  equations  (13).,  (14),  and  (18)  to  obtain  the  result 

for  W(G)  for  the  general  tree  as  follows: 

w(g)  -  s(D  +  s<iD  +  sdn)  (19) 

Upon  substituting  into  equation  (19)  the  relevant  expressions  and 
carrying  out  some  simplifications  of  the  resulting  equation,  we  obtain 
our  final  formula  for  W(G) ,  which  we  display  here  together  with  two 
other  formulas  which  w.ll  be  needed  to  recursively  determine  the  Wiener 
index: 

m 

n(G)  -  l  +  ][  tii  (20) 

1*1  ” 

m 

S(G)  »  n(G)  -  1  +  l  Si  (21) 

i«l  “ 

m 

W(G)  -  IjSi  (n  -  nO  +  -  n^nj.  -  1)1  +  (n  -  1)2  (22) 

IV.  Derivation  of  W(G)  for  Certain  Classes  of  Molecules 

We  now  demonstrate  that  the  completely  general  formula  presented 
in  equation  (22),  can  be  readily  applied  to  certain  specific  classes 


of  molecules  to  derive  closed  recursion  formulas. 
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The  Star  Molecule 

For  a  molecule  possessing  a  graph  G  in  Che  form  of  a  scar,  as 
illusCraCed  in  Figure  2,  W(G)  assumes  a  particularly  3imple  fora.  Since 
each  of  Che  subcrees  is  now  represenCed  by  a  single  vertex,  Che  Cents 
in  equacion  (22)  have  simple  numerical  values: 

n^  *  1 
Si  -  0 
Wi  -  0 


The  equacion  for  Che  scar  molecule  chus  simplifies  Co: 

W(G)  -  (n  -  l)2,  (23) 

a  resulc  originally  obtained  by  Bonchev  and  Trina jsti/.^® 

The  Unbranched  Chain  Molecule 


As  mencioned  above,  Che  unbranched  chain  paCh  molecule  also  has 
a  W(G)  index  which  can  be  obCained  as  a  simple  closed  formula.  If 
one  end  of  Che  chain  is  taken  as  the  root,  as  illustrated  in  Figure 
3,  Che  graph  G  can  be  decomposed  into  one  subtree  containing  (n  -  1) 
vertices.  The  terms  in  equacion  (22)  are  then  given  by  the  following 
expressions,  assuming  by  induction  Che  indicated  expression  for  Wp 

m  ■  (n  -  1) 

Sx  -  1  +  2  + _ +  (n  -  2)  -  -J  (n  -  1)  (n  -  2) 

Wi  -  |  [(n  -  1)3  -  (n  -  1)] 


Substitution  of  these  expressions  into  equation  (22),  yields  the  result 
for  the  unbranched  chain  molecule: 


W(G)  -  j  (n3  -  n) 


in  accordance  with  the  results  of  Hosoya,2®  and  Bonchev  and  Trina  jsti£.^ 


The  Star  Molecule  with  Attached  Chain 


In  this  case  we  take  as  the  root  the  vertex  at  the  end  of 
chain  which  is  attached  to  the  star,  as  shown  in  Figure  4.  Let 
be  the  length  of  the  chain,  so  that  the  root  vertex  has  degree  n 
Labeling  the  subtree  at  the  root  which  comprises  part  of  the  : 


as  number  one,  we  have 


1  +  2+  ...  +  0 

i  «3  -  S) 


n^  ■  1 

-  0 

■  0 


for  i.  -  2,  . . .  ,  (n  -  £) 


as  in  the  two  cases  treated  already.  Then  from  (22)  we  calculate 


£  i 

W(G)  -  (n  -  l)2  +  n(„)  -  i  (2i  +  5)  (2) 


or  W(G)  -  (*)  (n  -  &+•  -]  +  (n  -  1)2 


Note  that  (23)  is  obtained  when  l  ■  1  and  (24)  when  l  ■  n  -  1. 


The  Star  Molecule  with  Attached  Chains 


Here  we  assume  that  the  spokes  forming  the  star  have  lengths  2^, 

. . . ,  ly  as  illustrated  in  Figure  5.  By  taking  the  central  vertex  as 

the  root,  the  graph  G  can  again  be  decomposed  into  Ic  subtree  chains 

k 

of  length  Z±,  for  i  *  1,  . ..,  k,  where  n  *  Y  Z±  +  1.  The  terms  in 

-  i_-l  - 

equation  (22)  will  bes 


ni  -  Z± 

Si  -  1  +  2  + . +  (&i  -  1)  *  (  '  ) 


and  these  lead  to  the  following  expression  for  W(G): 


-  H  Ui  +  5 

W(G)  -  l  (  )  [n  -  ~  -  ]  +  (n  -  l)2.  (26) 

i-1 


The  special  case  of  this  formula  when  all  the  chains  are  of  length 
two,  i.e.  Z±  m  2,  is  given  by  the  equations 


W(G)  »  k(n  -  3)  +  (n  -  l)2 


Since  in  this  instance  k  ■  (n  -  l)/2,  the  formula  can  be  further  simplified  to 

I 


W(G)  *  *s(n  -  l)(3n  -  5)  . 


(27) 
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The  Tree  Molecule  with  Unbranched  Branches 


Here  we  let  the  branches  of  the  tree  molecule  be  of  arbitrary 
length,  as  illustrated  in  Figure  6.  Let  us  assume  that  the  branches 
have  lengths  L^,  L2,  L^  which  are  attached  to  the  main  chain  at 

positions  l<&i  <  ...  <  l  ^  <  C,  where  C  denotes  the  length  of  the  main 
chain.  We  shall  establish  the  formula 


C+l  +  1  Lj  +  1 

W(G)  =(  )  +  Y  (  -  )+C-Y(-  ) 

3  i  3  i  2 

&i  Q  .  1.  J 

l  ^  [(f)  +  (  -  )]  +  l  Li  L1  -  **) 

i  2  jL<j. 


Y  Lf  +  1 

+  1  [Li  (  — 


L«  +  1 


i<i 


'  X  U-i  <  i. 

[Li  (  -  )  +  Li  (  1  )] 

i  -*■  2  —  2 


(28) 


by  a  double  induction  first  on  m,  the  number  of  branches,  and  then 

on  Li,  the  number  of  atoms  in  the  first  branch.  To  start,  when  m  =  0 

C  +  l 

the  equation  reduces  to  the  formula  (  3  ' )»  established  earlier  for 

a  chain  of  length  C.  Now  assume  jn  >  0  and  that  Li  1.  We  apply  (22), 
using  the  last  atom  of  branch  1  as  the  root.  In  this  case  there  is 
only  one  subtree,  so  we  have 


ni  ■  (n  -  1). 

When  only  one  subtree  is  present,  equation  (22)  for  W  becomes 

W  *  Si  +  Wi  +  (n  -  1) 


■  Wi  +  S. 


By  induction,  we  know  that  Wi  is  given  by  (28)  with  Li  replaced  by 


(Li  -  1).  The  difference  between  the  original  (28)  and  (28)  with  Li 
replaced  by  (Li  -  1)  is  (accounting  here  for  the  six  terms  respectively): 


Li  +  1  Li  Li  +  1  Li 

0  +  [(  1  3  )  -  (3X)1  +  C-[(  2  )  -  <2  )3 


£i  C  -  *i  +  1  ? 

+  [(  L)  +  (  1  )I  +  L  Li  (£j  -  *x) 

2  2  i>l  A 


v  Li  +1  Li  L<  +  1 

+  1  [L,((  ,  )  -  (  l»  +  (  -1  ,  )), 

J.>1  J  2  2  2 


which,  after  some  elementary  algebra,  is  the  same  as: 


0  +  (Ll)  +  C-Li  +  [(**)  +  ( 


*1,  ,  ,C  “*1  +  1 


+  1  L<  (*4  -  \)  +  £  [LjLi  +  (  )]. 

jL>i  .  .  i>i 


Our  proof  is  completed  by  showing  that  this  difference  agrees  with 
S.  By  a  direct  computation  we  have 


S  ■  1  +  2  +  ...  +  (L^  -  1)  (points  in  same  branch) 


+  (Li  +  1)  +  (L^  +  2)  + 


(point  of  juncture) 

+  (LL  +  £t  -  1)  (points  left 

of  juncture) 


+  (Li  +  1)  +  (Li  +  2)  + 


..  +  (Li  +  (C  -  £1))  (points  right 

of  juncture) 


+  I  I  [Li  +  ( £ j  -  £1)  +  b]  (points  on  other  branches) 

i>i  b-1  "L 


Li  +  1  *1 

(  )  +  (^i  “  l).Li  +  (  ^  )  +  (C  -  £1)  .  Li 

+  (C  Zl  +  S  +  £  [LtLj  +  (  -  £i).Lj  +  (L-*  +  l)l 

i>i  2 


2 
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where  1>  is  the  distance  from  the  main  chain  to  the  various  atoms  on 
the  j^th  branch.  Again,  using  elementary  algebra,  this  latter  expression 
is  seen  to  agree  with  our  previously  computed  difference  between  the 
original  (28)  and  (28)  with  L3  replaced  by  (L3  -  1).  Thus,  the  proof 
is  completed.  Notwithstanding  the  fact  that  this  is  a  new  result, 
it  can  be  shown  to  be  equivalent  to  the  formula  published  by  Bonchev 
and  Trina jstic though  it  is  simpler  in  format. 

We  now  apply  (28)  to  a  few  special  cases  which  have  been  considered 
individually  in  the  literature. 


The  Tree  Molecule  with  Branches  of  Length  One 


The  graph  of  the  molecule  is  illustrated  in  Figure  7.  Again  letting 
the  length  of  the  chain  be  C,  we  have,  in  terms  of  the  notation  of 
(28),  that  ra  =  2(C  -  2),  Lx  *  L2  =  • . .  *  1^  *  1,  £3  -  2,  £2  -  2,  £3  *  3, 
£4  =  3,  .  . . ,  £m  =  C  -  1,  and  so 

m  r- 1 

C+l  ~  %  £  C  -  £  +  1 

W(G)  =(  .  )+0+C-Jl+  Il()+(  7  ) ] • 2 

3  JL=1  £=2  2  2 

+  I  (£2”£^)'4+  I  2 

2<  £  1<£2  1  C_1  1  <  i<±  <  m 

*  y  (3C3  -  27C  +  32).  (29) 


This  case  is  similar  to  the  immediately  preceeding  case,  but  now 
all  the  branches  are  of  length  L  instead  of  length  one.  The  graph 
is  shown  in  Figure  8.  Again  with  C  denoting  the  length  of  the  chain, 


-24- 


we  can  use  (28)  with  m  ■  2(C  -  2),  1*1  •  1*2  “  "  ’  *  *  I'm  "  **,  £l  3  2, 
£2*2,  £3-3,  £4=3,  ••*,  £5,  *  C  -  1,  to  obtain 


m  m 

C+l  TL+l  r 

W(G)  -  (  .  )+I  (  ,  )  +  C  •  l 

i-1  ■*  i-1 


L+l 
(  2  > 


C-1  £  c  -  £  +  1  „ 

+L-  l  [(2)  +  (  2  ) ] *2  +  L2.  I  (*2  -  tl)-4 

A-2  2<£i<£2<C-l 

L  +  1  r 

+  I*  (  2  )  *  I  2 

1  <  Kj  <  m 

C+l  L  +  l  L+l  L+l 

-  (  3  )+  <2C  -  4)  ’  [<  3  )  +  C-(  2  )  +  (2C  -  5) *L* (  2  )] 


+4L[(2)+L(  3  )  ]. 


(30) 


V.  An  Illustrative  Worked  Example 


The  formulas  derived  above  relate  to  specific  classes  of  tree-like 
molecules.  When  the  Wiener  index,  W(G),  is  required  for  a  molecule 
not  belonging  to  one  of  these  classes,  resort  must  be  made  to  our  general 
formula  given  in  equation  (22).  Below  we  illustrate  how  our  formula 
is  applied  in  the  case  of  the  molecule  of  2,5-dimethyl-2-ethyl-4-propyl 
hexane.  This  molecule  has  a  graph,  G,  which  does  not  belong  in  any 
of  the  above  special  classes;  its  hydrogen- suppressed  graph  is 
illustrated  in  Figure  9. 

As  in  all  the  previously  described  special  cases,  to  determine 
W(G)  for  this  molecule  one  sets  out  by  arbitrarily  selecting  one  vertex 
in  the  graph  of  the  molecule  and  calling  this  the  root.  In  Figure 
9  the  root  vertex  we  have  selected  is  indicated  by  a  full  circle. 
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Starting  from  this  root,  one  moves  out  progressively  toward  the 
extremities  of  the  graph,  as  indicated  by  the  arrows  placed  on  the 
edges  of  the  graph.  It  is  the  outermost  vertices  which  are  tackled 
first.  For  each  vertex  the  triplet  (n^,  S^,  W^)  is  evaluated  in  turn 
starting  from  the  outermost  vertices  and  working  back  towards  the  root. 
The  value  of  the  triplet  (n^,  S^,  W^)  for  the  root,  which  will  be  the 
last  triplet  to  be  determined,  automatically  gives  the  required  value 
of  W(G)  for  the  molecule  as  a  whole.  This  is  equivalent  to  calculating 
W(G)  by  substituting  appropriate  terms  into  equation  (22)  in  a  recursive 
manner . 

Because  every  vertex  at  an  extremity  of  the  graph  G  will  have 
only  one  attached  edge,  the  triplet  (n^,  S^,  D^)  will  always  assume 
the  value  (1,0,0)  for  such  vertices  and  may  be  written  in  directly. 
All  other  triplets  need  to  be  calculated.  Considering  next  the  two 
longest  side  chains  in  the  molecule,  it  is  readily  seen  that  for  a 
vertex  attached  to  the  end  vertex  the  triplet  will  assume  the  value 
(2,1,1).  The  third  vertex  from  the  end  along  the  longest  chain  has 
the  triplet  (3,3,4).  This  is  calculated  by  summing  the  two  previously 
evaluated  n^  values,  and  by  making  use  of  equations  (20-22)  as  shown 
below: 


nj  *  1  +  ni  *1+2*3 
Si-n-l+Si-3-1+1-3 

■  Si  (n  -  n^)  +  Vi'i  -  n^(ni  -  1)  +  (n  -  l)2 
-  1- (3  -  2)  +  1  -  2-1  +  22  -  4 


(ni,  Slt  Wi)  -  (3,3,4). 
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The  other  triplets  are  progressively  evaluated  by  an  analogous  procedure 
until  the  root  vertex  is  reached.  The  values  obtained  for  each  of 
the  triplets  are  shown  in  Figure  9.  The  triplet  for  the  root  cones 
out  to  be  (13,27,258),  so  that  W(G)  *  258. 

VI.  The  Substitution  of  Carbon  Atoms  by  Heteroatoms 

We  now  consider  the  changes  that  need  to  be  made  in  our  general 

formula  (22)  when  heteroatoms  are  substituted  for  carbon  atoms  in 

tree-like  molecules.  In  purely  graph-theoretical  terms,  such 

substitution  corresponds  to  a  weighting  of  the  vertices  and  edges  in 

the  tree  graph,  G.  The  number  of  edges  and  vertices  in  G  which  can 

be  weighted  is  arbitrary;  if  necessary,  all  of  the  edges  and  vertices 

% 

can  be  weighted. 

Let  us  suppose  initially  that  just  one  heteroatom.  A,  is  substituted 
for  a  carbon  atom  In  the  tree-like  molecule  and  that  A  has  the  same 
valence  as  that  of  the  atom  it  substitutes.  The  situation  which  then 
arises  is  illustrated  in  Figure  10.  The  initial  edge  weights  for  all 
edges  emanating  f  jm  the  original  carbon  atom  we  represent  as  6^, 
d2****»dm  and  new  edge  weights  after  the  substitution  of  the  carbon 

atom  by  A  we  shall  represent  as  olf  a2 . .  a0.  We  shall  also  choose 

the  vertex  associated  with  atom  A  as  the  root  vertex.  From  equations 
(13)  and  (14),  it  follows  that: 

(31) 

£  (aj  -  d^n^2 


®new  ■  sold  +  l  n i  "  5i.) 


Wnew  "  ^old  +  n_i  (Snev  "  SQi j)  - 


+  d<vA»vA)/2 


(32) 


*  wold  +  I  (<*i  "  5i)n^(n  -  n.j_)  +  d(vA,vA)/2 
_i 

where  Snew  and  Wnew  refer  to  the  respective  values  of  S  and  W  after 
the  substitution  of  atom  A,  S0u  and  refer  to  these  values  before 

the  substitution  of  atom  A,  and  d(vA,vA)  is  the  weight  assigned  to 

atom  A  for  the  d^^  matrix  entry  in  D(G). 

From  equation  (32),  it  is  seen  that  the  substitution  of  one  carbon 
atom  by  one  heteroatom,  A,  will  produce  a  change  in  the  value  of  W(G) 
for  the  tree-like  molecule  equal  to: 

A W(G)  *  £  ( at£  -  6i)n^(n  -  n*)  +  d(vA,v^)/2  (33) 

i  ~  ~  ~ 

The  effect  of  substituting  one  heteroatom  is  thus  to  bring  about  a 

change  in  the  value  of  (a*  -  <$j.)  for  each  pair  of  vertices,  (X,Y), 
such  that  the  shortest  path  from  X  to  Y  includes  the  edge  in  question, 
plus  the  addition  of  one  half  of  the  atomic  weighting  term,  d(vA,vA) . 
The  total  number  of  pairs  of  vertices  (X,Y)  will  be  given  by  the 

expression  n^(n  -  n^).  It  follows  that  the  substitution  of  h  heteroatoms 
for  carbon  atoms  will  result  in  a  new  W(G)  value  for  the  tree-like 

molecule  equal  to: 

h 

W(G)new  *  W(G)0id  +  £  7  (ai  -$i)n£(n  -  n^)  +  £  d(vA,vA)/2  (34) 
-  -  hi  "  ~  "  ~  A-l 

From  the  above  reasoning,  it  is  evident  that  the  substitution 
of  heteroatoms  into  a  tree-like  molecule  for  which  W(G)  is  known  does 
not  neccessitate  a  de  novo  calculation  of  W(G).  The  known  W(G),  referred 
to  above  as  W(G)0id>  is  simply  modified  upon  substitution  by  the  two 
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latter  terms  in  equation  (34).  It  need  hardly  be  emphasized  that  the 
modification  of  W(G)0id  using  equation  (34)  involves  a  much  shorter 
calculation  then  a  recalculation  of  W(G)  de  novo.  To  illustrate  this 
point,  we  consider  the  molecule  shown  in  Figure  9  again  to  suppose 
now  that  the  root  atom  is  replaced  by  the  heteroatom  N.  Using  the 
tables  of  Barysz  et  al.,^0  we  know  that  the  d^£  entry  for  N  is  0.143 
and  that  the  C-N  bond  has  a  weighting  of  0.857.  In  this  case,  therefore, 
the  new  value  of  W(G)  is  given  straightforwardly  by  the  expression: 

3 

W(G)new  *  W<G)old  +  l  l  (0.857  -  1)(13  -  n±)  +  £  0.143/2 

h*l  _i«l  A*1 

*  258  -  14.586  +  —  ~  -  243.486 

Such  a  procedure  effects  a  substantial  saving  in  time  when  W(G)new 
values  are  to  be  computed. 
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Captions  to  Figures 

Figure  1:  A  representation  of  the  general  tree,  with  one  distinguished 
root. 

Figure  2:  The  graph  of  a  star  molecule  with  one  distinguished  vertex. 


Figure  3:  The  graph  of  an  unbranched  chain  molecule  with  one 
distinguished  vertex. 

Figure  4:  The  graph  of  a  star  molecule  with  attached  chain  and  one 
distinguished  vertex. 

Figure  5:  The  graph  of  a  star  molecule  with  attached  chains  with  one 
distinguished  vertex. 

Figure  6:  The  graph  of  a  chain  molecule  having  unbranched  branches 
and  one  distinguished  vertex. 

Figure  7:  The  graph  of  a  tree  molecule  having  branches  of  length  one 
and  one  distinguished  vertex. 

Figure  8:  The  graph  of  a  tree  molecule  having  branches  of  given  length 
and  one  distinguished  vertex. 

Figure  9:  Graph  of  the  molecule  of  2,5-dimethyl-2-ethyl-4-propyl-hexane 
with  one  distinguished  vertex.' 

Figure  10:  Representation  of  the  substitution  of  a  heteroatom  A  into 
the  graph  of  a  hydrocarbon  molecule. 
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